Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 804 |
| Missing cells | 18 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 106.9 KiB |
| Average record size in memory | 136.2 B |
Variable types
| NUM | 9 |
|---|---|
| BOOL | 4 |
| CAT | 3 |
| UNSUPPORTED | 1 |
Reproduction
| Analysis started | 2020-08-30 06:46:03.359760 |
|---|---|
| Analysis finished | 2020-08-30 06:46:19.777671 |
| Duration | 16.42 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
ID has unique values | Unique |
Education_loan is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Emp_duration has 30 (3.7%) zeros | Zeros |
| Distinct count | 804 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 502.5 |
|---|---|
| Minimum | 101 |
| Maximum | 904 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 141.15 |
| Q1 | 301.75 |
| median | 502.5 |
| Q3 | 703.25 |
| 95-th percentile | 863.85 |
| Maximum | 904 |
| Range | 803 |
| Interquartile range (IQR) | 401.5 |
Descriptive statistics
| Standard deviation | 232.2391009 |
|---|---|
| Coefficient of variation (CV) | 0.462167365 |
| Kurtosis | -1.2 |
| Mean | 502.5 |
| Median Absolute Deviation (MAD) | 201 |
| Skewness | 0 |
| Sum | 404010 |
| Variance | 53935 |
| Value | Count | Frequency (%) | |
| 904 | 1 | 0.1% | |
| 364 | 1 | 0.1% | |
| 374 | 1 | 0.1% | |
| 373 | 1 | 0.1% | |
| 372 | 1 | 0.1% | |
| 371 | 1 | 0.1% | |
| 370 | 1 | 0.1% | |
| 369 | 1 | 0.1% | |
| 368 | 1 | 0.1% | |
| 367 | 1 | 0.1% | |
| Other values (794) | 794 | 98.8% |
| Value | Count | Frequency (%) | |
| 101 | 1 | 0.1% | |
| 102 | 1 | 0.1% | |
| 103 | 1 | 0.1% | |
| 104 | 1 | 0.1% | |
| 105 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 904 | 1 | 0.1% | |
| 903 | 1 | 0.1% | |
| 902 | 1 | 0.1% | |
| 901 | 1 | 0.1% | |
| 900 | 1 | 0.1% |
Default
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 6.3 KiB |
| No | |
|---|---|
| Yes | |
| (Missing) | 1 |
| Value | Count | Frequency (%) | |
| No | 568 | 70.6% | |
| Yes | 235 | 29.2% | |
| (Missing) | 1 | 0.1% |
Checking_amount
Real number (ℝ)
| Distinct count | 584 |
|---|---|
| Unique (%) | 72.7% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 370.07098381070983 |
|---|---|
| Minimum | -436.0 |
| Maximum | 1319.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | -436 |
|---|---|
| 5-th percentile | -108.9 |
| Q1 | 165.5 |
| median | 357 |
| Q3 | 567 |
| 95-th percentile | 886.5 |
| Maximum | 1319 |
| Range | 1755 |
| Interquartile range (IQR) | 401.5 |
Descriptive statistics
| Standard deviation | 301.8784477 |
|---|---|
| Coefficient of variation (CV) | 0.8157312 |
| Kurtosis | -0.04827371015 |
| Mean | 370.0709838 |
| Median Absolute Deviation (MAD) | 199 |
| Skewness | 0.2079262984 |
| Sum | 297167 |
| Variance | 91130.5972 |
| Value | Count | Frequency (%) | |
| 375 | 5 | 0.6% | |
| 58 | 4 | 0.5% | |
| 297 | 4 | 0.5% | |
| 231 | 4 | 0.5% | |
| 16 | 4 | 0.5% | |
| 445 | 4 | 0.5% | |
| 255 | 3 | 0.4% | |
| 170 | 3 | 0.4% | |
| 568 | 3 | 0.4% | |
| -44 | 3 | 0.4% | |
| Other values (574) | 766 | 95.3% |
| Value | Count | Frequency (%) | |
| -436 | 1 | 0.1% | |
| -407 | 1 | 0.1% | |
| -386 | 1 | 0.1% | |
| -383 | 1 | 0.1% | |
| -379 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1319 | 1 | 0.1% | |
| 1296 | 1 | 0.1% | |
| 1275 | 1 | 0.1% | |
| 1217 | 1 | 0.1% | |
| 1213 | 1 | 0.1% |
Term
Real number (ℝ≥0)
| Distinct count | 19 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.835616438356166 |
|---|---|
| Minimum | 9.0 |
| Maximum | 27.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 16 |
| median | 18 |
| Q3 | 20 |
| 95-th percentile | 23 |
| Maximum | 27 |
| Range | 18 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.266680075 |
|---|---|
| Coefficient of variation (CV) | 0.1831548737 |
| Kurtosis | -0.09089679072 |
| Mean | 17.83561644 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.02861842049 |
| Sum | 14322 |
| Variance | 10.67119872 |
| Value | Count | Frequency (%) | |
| 18 | 109 | 13.6% | |
| 17 | 98 | 12.2% | |
| 19 | 92 | 11.4% | |
| 16 | 73 | 9.1% | |
| 20 | 72 | 9.0% | |
| 21 | 69 | 8.6% | |
| 15 | 67 | 8.3% | |
| 14 | 49 | 6.1% | |
| 22 | 36 | 4.5% | |
| 13 | 34 | 4.2% | |
| Other values (9) | 104 | 12.9% |
| Value | Count | Frequency (%) | |
| 9 | 3 | 0.4% | |
| 10 | 7 | 0.9% | |
| 11 | 11 | 1.4% | |
| 12 | 21 | 2.6% | |
| 13 | 34 | 4.2% |
| Value | Count | Frequency (%) | |
| 27 | 3 | 0.4% | |
| 26 | 5 | 0.6% | |
| 25 | 13 | 1.6% | |
| 24 | 15 | 1.9% | |
| 23 | 26 | 3.2% |
Credit_score
Real number (ℝ≥0)
| Distinct count | 276 |
|---|---|
| Unique (%) | 34.4% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 759.5960099750623 |
|---|---|
| Minimum | 376.0 |
| Maximum | 1029.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 376 |
|---|---|
| 5-th percentile | 610 |
| Q1 | 725 |
| median | 770 |
| Q3 | 811 |
| 95-th percentile | 860 |
| Maximum | 1029 |
| Range | 653 |
| Interquartile range (IQR) | 86 |
Descriptive statistics
| Standard deviation | 76.49096547 |
|---|---|
| Coefficient of variation (CV) | 0.1006995356 |
| Kurtosis | 1.961721091 |
| Mean | 759.59601 |
| Median Absolute Deviation (MAD) | 43 |
| Skewness | -0.879983214 |
| Sum | 609196 |
| Variance | 5850.867799 |
| Value | Count | Frequency (%) | |
| 771 | 12 | 1.5% | |
| 773 | 10 | 1.2% | |
| 813 | 9 | 1.1% | |
| 766 | 9 | 1.1% | |
| 756 | 9 | 1.1% | |
| 799 | 9 | 1.1% | |
| 763 | 8 | 1.0% | |
| 815 | 8 | 1.0% | |
| 844 | 8 | 1.0% | |
| 772 | 8 | 1.0% | |
| Other values (266) | 712 | 88.6% |
| Value | Count | Frequency (%) | |
| 376 | 1 | 0.1% | |
| 451 | 1 | 0.1% | |
| 469 | 1 | 0.1% | |
| 512 | 2 | 0.2% | |
| 518 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1029 | 1 | 0.1% | |
| 991 | 1 | 0.1% | |
| 974 | 1 | 0.1% | |
| 962 | 1 | 0.1% | |
| 941 | 1 | 0.1% |
Gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 KiB |
| Male | |
|---|---|
| Female |
| Value | Count | Frequency (%) | |
| Male | 518 | 64.4% | |
| Female | 286 | 35.6% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.711442786 |
| Min length | 4 |
Marital_status
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 KiB |
| Single | |
|---|---|
| Married |
| Value | Count | Frequency (%) | |
| Single | 421 | 52.4% | |
| Married | 383 | 47.6% |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.476368159 |
| Min length | 6 |
Car_loan
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 6.3 KiB |
| No | |
|---|---|
| Yes | |
| (Missing) | 1 |
| Value | Count | Frequency (%) | |
| No | 540 | 67.2% | |
| Yes | 263 | 32.7% | |
| (Missing) | 1 | 0.1% |
Personal_loan
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 3 |
| Missing (%) | 0.4% |
| Memory size | 6.3 KiB |
| Yes | |
|---|---|
| No | |
| (Missing) | 3 |
| Value | Count | Frequency (%) | |
| Yes | 408 | 50.7% | |
| No | 393 | 48.9% | |
| (Missing) | 3 | 0.4% |
Home_loan
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Memory size | 6.3 KiB |
| No | |
|---|---|
| Yes | 42 |
| (Missing) | 2 |
| Value | Count | Frequency (%) | |
| No | 760 | 94.5% | |
| Yes | 42 | 5.2% | |
| (Missing) | 2 | 0.2% |
Emp_status
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 KiB |
| unemployed | |
|---|---|
| employed |
| Value | Count | Frequency (%) | |
| unemployed | 503 | 62.6% | |
| employed | 301 | 37.4% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.251243781 |
| Min length | 8 |
Amount
Real number (ℝ≥0)
| Distinct count | 573 |
|---|---|
| Unique (%) | 71.4% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1217.6301369863013 |
|---|---|
| Minimum | 244.0 |
| Maximum | 2362.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 244 |
|---|---|
| 5-th percentile | 717.2 |
| Q1 | 1007.5 |
| median | 1224 |
| Q3 | 1422 |
| 95-th percentile | 1719 |
| Maximum | 2362 |
| Range | 2118 |
| Interquartile range (IQR) | 414.5 |
Descriptive statistics
| Standard deviation | 308.279582 |
|---|---|
| Coefficient of variation (CV) | 0.2531799868 |
| Kurtosis | -0.008605283972 |
| Mean | 1217.630137 |
| Median Absolute Deviation (MAD) | 205 |
| Skewness | -0.02283736831 |
| Sum | 977757 |
| Variance | 95036.30069 |
| Value | Count | Frequency (%) | |
| 1062 | 5 | 0.6% | |
| 1085 | 5 | 0.6% | |
| 1281 | 4 | 0.5% | |
| 1286 | 4 | 0.5% | |
| 1146 | 4 | 0.5% | |
| 1429 | 4 | 0.5% | |
| 1162 | 4 | 0.5% | |
| 783 | 3 | 0.4% | |
| 1111 | 3 | 0.4% | |
| 1224 | 3 | 0.4% | |
| Other values (563) | 764 | 95.0% |
| Value | Count | Frequency (%) | |
| 244 | 1 | 0.1% | |
| 316 | 1 | 0.1% | |
| 385 | 1 | 0.1% | |
| 386 | 1 | 0.1% | |
| 395 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 2362 | 1 | 0.1% | |
| 2066 | 1 | 0.1% | |
| 2052 | 1 | 0.1% | |
| 1983 | 1 | 0.1% | |
| 1982 | 1 | 0.1% |
Saving_amount
Real number (ℝ≥0)
| Distinct count | 594 |
|---|---|
| Unique (%) | 73.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3174.814676616915 |
|---|---|
| Minimum | 2082 |
| Maximum | 4108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 2082 |
|---|---|
| 5-th percentile | 2615.15 |
| Q1 | 2948.75 |
| median | 3200 |
| Q3 | 3395 |
| 95-th percentile | 3715.55 |
| Maximum | 4108 |
| Range | 2026 |
| Interquartile range (IQR) | 446.25 |
Descriptive statistics
| Standard deviation | 340.2855686 |
|---|---|
| Coefficient of variation (CV) | 0.1071828133 |
| Kurtosis | -0.08647134084 |
| Mean | 3174.814677 |
| Median Absolute Deviation (MAD) | 227 |
| Skewness | -0.1026719465 |
| Sum | 2552551 |
| Variance | 115794.2682 |
| Value | Count | Frequency (%) | |
| 3391 | 4 | 0.5% | |
| 3183 | 4 | 0.5% | |
| 2929 | 4 | 0.5% | |
| 3459 | 4 | 0.5% | |
| 3273 | 4 | 0.5% | |
| 3282 | 4 | 0.5% | |
| 3384 | 4 | 0.5% | |
| 3199 | 3 | 0.4% | |
| 3237 | 3 | 0.4% | |
| 3145 | 3 | 0.4% | |
| Other values (584) | 767 | 95.4% |
| Value | Count | Frequency (%) | |
| 2082 | 1 | 0.1% | |
| 2145 | 1 | 0.1% | |
| 2191 | 2 | 0.2% | |
| 2290 | 1 | 0.1% | |
| 2350 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 4108 | 1 | 0.1% | |
| 4044 | 1 | 0.1% | |
| 4022 | 1 | 0.1% | |
| 4021 | 1 | 0.1% | |
| 4014 | 1 | 0.1% |
| Distinct count | 121 |
|---|---|
| Unique (%) | 15.1% |
| Missing | 3 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.561797752808985 |
|---|---|
| Minimum | 0.0 |
| Maximum | 120.0 |
| Zeros | 30 |
| Zeros (%) | 3.7% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 14 |
| median | 39 |
| Q3 | 79 |
| 95-th percentile | 114 |
| Maximum | 120 |
| Range | 120 |
| Interquartile range (IQR) | 65 |
Descriptive statistics
| Standard deviation | 37.13992446 |
|---|---|
| Coefficient of variation (CV) | 0.7808772211 |
| Kurtosis | -1.095347249 |
| Mean | 47.56179775 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 0.4758560852 |
| Sum | 38097 |
| Variance | 1379.373989 |
| Value | Count | Frequency (%) | |
| 0 | 30 | 3.7% | |
| 10 | 17 | 2.1% | |
| 5 | 17 | 2.1% | |
| 6 | 15 | 1.9% | |
| 1 | 15 | 1.9% | |
| 42 | 14 | 1.7% | |
| 11 | 14 | 1.7% | |
| 21 | 14 | 1.7% | |
| 22 | 13 | 1.6% | |
| 12 | 13 | 1.6% | |
| Other values (111) | 639 | 79.5% |
| Value | Count | Frequency (%) | |
| 0 | 30 | 3.7% | |
| 1 | 15 | 1.9% | |
| 2 | 11 | 1.4% | |
| 3 | 12 | 1.5% | |
| 4 | 12 | 1.5% |
| Value | Count | Frequency (%) | |
| 120 | 7 | 0.9% | |
| 119 | 7 | 0.9% | |
| 118 | 5 | 0.6% | |
| 117 | 6 | 0.7% | |
| 116 | 6 | 0.7% |
Age
Real number (ℝ≥0)
| Distinct count | 25 |
|---|---|
| Unique (%) | 3.1% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.273972602739725 |
|---|---|
| Minimum | 18.0 |
| Maximum | 42.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 29 |
| median | 32 |
| Q3 | 34 |
| 95-th percentile | 38 |
| Maximum | 42 |
| Range | 24 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.096060699 |
|---|---|
| Coefficient of variation (CV) | 0.1309734696 |
| Kurtosis | -0.1453879945 |
| Mean | 31.2739726 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.2347842426 |
| Sum | 25113 |
| Variance | 16.77771325 |
| Value | Count | Frequency (%) | |
| 32 | 95 | 11.8% | |
| 33 | 76 | 9.5% | |
| 30 | 71 | 8.8% | |
| 31 | 67 | 8.3% | |
| 29 | 63 | 7.8% | |
| 35 | 60 | 7.5% | |
| 34 | 55 | 6.8% | |
| 28 | 48 | 6.0% | |
| 36 | 47 | 5.8% | |
| 27 | 42 | 5.2% | |
| Other values (15) | 179 | 22.3% |
| Value | Count | Frequency (%) | |
| 18 | 1 | 0.1% | |
| 19 | 1 | 0.1% | |
| 20 | 3 | 0.4% | |
| 21 | 3 | 0.4% | |
| 22 | 8 | 1.0% |
| Value | Count | Frequency (%) | |
| 42 | 2 | 0.2% | |
| 41 | 3 | 0.4% | |
| 40 | 4 | 0.5% | |
| 39 | 15 | 1.9% | |
| 38 | 22 | 2.7% |
No_of_credit_acc
Real number (ℝ≥0)
| Distinct count | 9 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4022415940224158 |
|---|---|
| Minimum | 1.0 |
| Maximum | 9.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.536373603 |
|---|---|
| Coefficient of variation (CV) | 0.6395583219 |
| Kurtosis | 2.748836272 |
| Mean | 2.402241594 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.498095834 |
| Sum | 1929 |
| Variance | 2.360443847 |
| Value | Count | Frequency (%) | |
| 2 | 285 | 35.4% | |
| 1 | 261 | 32.5% | |
| 4 | 81 | 10.1% | |
| 5 | 79 | 9.8% | |
| 3 | 79 | 9.8% | |
| 9 | 8 | 1.0% | |
| 7 | 4 | 0.5% | |
| 8 | 3 | 0.4% | |
| 6 | 3 | 0.4% | |
| (Missing) | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 261 | 32.5% | |
| 2 | 285 | 35.4% | |
| 3 | 79 | 9.8% | |
| 4 | 81 | 10.1% | |
| 5 | 79 | 9.8% |
| Value | Count | Frequency (%) | |
| 9 | 8 | 1.0% | |
| 8 | 3 | 0.4% | |
| 7 | 4 | 0.5% | |
| 6 | 3 | 0.4% | |
| 5 | 79 | 9.8% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| ID | Default | Checking_amount | Term | Credit_score | Gender | Marital_status | Car_loan | Personal_loan | Home_loan | Education_loan | Emp_status | Amount | Saving_amount | Emp_duration | Age | No_of_credit_acc | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 101 | No | 988.0 | 15.0 | 796.0 | Female | Single | Yes | No | No | No | employed | 1536.0 | 3455 | 12.0 | 38.0 | 1.0 |
| 1 | 102 | No | 458.0 | 15.0 | 813.0 | Female | Single | Yes | No | No | No | employed | 947.0 | 3600 | 25.0 | 36.0 | 1.0 |
| 2 | 103 | No | 158.0 | 14.0 | 756.0 | Female | Single | No | Yes | No | No | employed | 1678.0 | 3093 | 43.0 | 34.0 | 1.0 |
| 3 | 104 | Yes | 300.0 | 25.0 | 737.0 | Female | Single | No | No | No | Yes | employed | 1804.0 | 2449 | 0.0 | 29.0 | 1.0 |
| 4 | 105 | Yes | 63.0 | 24.0 | 662.0 | Female | Single | No | No | No | Yes | unemployed | 1184.0 | 2867 | 4.0 | 30.0 | 1.0 |
| 5 | 106 | No | 1071.0 | 20.0 | 828.0 | Male | Married | Yes | No | No | No | employed | 475.0 | 3282 | 12.0 | 32.0 | 2.0 |
| 6 | 107 | No | -192.0 | 13.0 | 856.0 | Male | Single | Yes | No | No | No | employed | 626.0 | 3398 | 11.0 | 38.0 | 1.0 |
| 7 | 108 | No | 172.0 | 16.0 | 763.0 | Female | Single | Yes | No | No | No | employed | 1224.0 | 3022 | 12.0 | 36.0 | 1.0 |
| 8 | 109 | No | 585.0 | 20.0 | 778.0 | Female | Single | Yes | No | No | No | unemployed | 1162.0 | 3475 | 12.0 | 36.0 | 1.0 |
| 9 | 110 | Yes | 189.0 | 19.0 | 649.0 | Male | Married | Yes | No | No | No | employed | 786.0 | 2711 | 0.0 | 29.0 | 1.0 |
Last rows
| ID | Default | Checking_amount | Term | Credit_score | Gender | Marital_status | Car_loan | Personal_loan | Home_loan | Education_loan | Emp_status | Amount | Saving_amount | Emp_duration | Age | No_of_credit_acc | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 794 | 895 | No | 142.0 | 13.0 | 773.0 | Male | Married | Yes | No | No | 0 | unemployed | 1394.0 | 3134 | 19.0 | 30.0 | 2.0 |
| 795 | 896 | No | 145.0 | 21.0 | 792.0 | Male | Married | No | Yes | No | 0 | unemployed | 1183.0 | 2881 | 14.0 | 35.0 | 1.0 |
| 796 | 897 | No | 414.0 | 22.0 | 752.0 | Male | Married | No | Yes | No | 0 | unemployed | 1373.0 | 3165 | 42.0 | 30.0 | 2.0 |
| 797 | 898 | Yes | 85.0 | 20.0 | 843.0 | Male | Married | Yes | No | No | 0 | unemployed | 1078.0 | 3212 | 109.0 | 30.0 | 2.0 |
| 798 | 899 | Yes | -293.0 | 21.0 | 818.0 | Female | Single | Yes | No | No | 0 | unemployed | 1002.0 | 2983 | 0.0 | 29.0 | 2.0 |
| 799 | 900 | No | 393.0 | 18.0 | 846.0 | Female | Single | No | Yes | No | 0 | unemployed | 1603.0 | 3282 | 54.0 | 31.0 | 1.0 |
| 800 | 901 | No | 462.0 | 21.0 | 810.0 | Female | Single | Yes | No | No | 0 | unemployed | 1435.0 | 3873 | 110.0 | 32.0 | 1.0 |
| 801 | 902 | No | 717.0 | 17.0 | 739.0 | Male | Married | Yes | No | No | 0 | unemployed | 1669.0 | 3453 | 32.0 | 31.0 | 2.0 |
| 802 | 903 | No | 822.0 | 17.0 | 783.0 | Male | Married | No | Yes | No | 0 | unemployed | 1041.0 | 3312 | 43.0 | 34.0 | 2.0 |
| 803 | 904 | Yes | 512.0 | 18.0 | 601.0 | Male | Married | Yes | No | No | 0 | unemployed | 997.0 | 3060 | 104.0 | 26.0 | 1.0 |